Semi-supervised Learning of Utterances Using Hidden Vector State Language Model

نویسندگان

Manzoor Ahmad Chachoo

M. K. Quadri

چکیده

Spoken dialogue system has an uncertain parameter during the speech recognition which controls its performance that vary for the different users as well as for the same user during multiple repetitions of even the same dialogue. This paper discusses how recognition errors in the users utterances can be handled by making use of semi-supervised learning techniques over the hidden vector state (HVS) model. The HVS Model is an extension of basic Markov model in which the context is encoded in each state as a vector. The state transitions in the HVS are factored into a stack shift operation similar to the push-down automaton. HVS-Model being a statistical model requires lot of labeled training data which is practically difficult. In this paper we present how classification and expectation-maximization semi-supervised learning approaches can be trained on both labeled and unlabelled corpora for handling the uncertainty by the user as well as the recognition errors by speech recognition system. The experimental results show that the proposed framework using the HVS model can improve the performance of the dialogue management of the spoken dialogue system when compared with the baseline model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Supervised Transductive Speaker Identification

We present an application of transductive semi-supervised learning to the problem of speaker identification. Formulating this problem as one of transduction is the most natural choice in some scenarios, such as when annotating archived speech data. Experiments with the CHAINS corpus show that, using the basic MFCC-encoding of recorded utterances, a well known simple semi-supervised algorithm, l...

متن کامل

Combining active and semi-supervised learning for spoken language understanding

In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is ...

متن کامل

Semi-supervised learning of the hidden vector state model for extracting protein-protein interactions

OBJECTIVE The hidden vector state (HVS) model is an extension of the basic discrete Markov model in which context is encoded as a stack-oriented state vector. It has been applied successfully for protein-protein interactions extraction. However, the HVS model, being a statistically based approach, requires large-scale annotated corpora in order to reliably estimate model parameters. This is nor...

متن کامل

Semi-supervised Learning for Spoken Language Understanding Using Semantic Role Labeling

In a goal-oriented spoken dialog system, the major aim of language understanding is to classify utterances into one or more of the pre-defined intents and extract the associated named entities. Typically, the intents are designed by a human expert according to the application domain. Furthermore, these systems are trained using large amounts of data manually labeled using an already prepared la...

متن کامل

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

We propose a unified global entropy reduction maximization (GERM) framework for active learning and semi-supervised learning for speech recognition. Active learning aims to select a limited subset of utterances for transcribing from a large amount of un-transcribed utterances, while semi-supervised learning addresses the problem of selecting right transcriptions for un-transcribed utterances, s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Semi-supervised Learning of Utterances Using Hidden Vector State Language Model

نویسندگان

چکیده

منابع مشابه

Semi-Supervised Transductive Speaker Identification

Combining active and semi-supervised learning for spoken language understanding

Semi-supervised learning of the hidden vector state model for extracting protein-protein interactions

Semi-supervised Learning for Spoken Language Understanding Using Semantic Role Labeling

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

عنوان ژورنال:

اشتراک گذاری